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SOME  NOTES  FOR  SIMPLE  PAV10VIAN  LEARHIHG 


by  A.  S.  Householder 


In  the  acccmpanying  figures  are  exhibited  seme  nets  representing  seme 
of  the  qualitative  features  of  the  simplest  types  of  Pavlovian  reflex 
learning.  It  is  perhaps  clear  how,  by  inserting  intemuncials  and  adding 
circuits,  the  quantitative  characteristics  can  be  adjusted.  Admittedly  a 
great  many  such  additions  might  be  required  before  achieving  any  degree  of 
real i  am. 

Fig.  1  represents  the  simplest  form  of  conditioning  vrith  extinction^ 

Here  and  hereafter  the  threshold  is  1  or  irrelevant  unless  inserted;  and  S 
and  3*  denote  the  unconditioned  and  conditioned  stimuli,  respectively.  In 
this  net  the  occurrence  of  Sf(t)  S(t  ■+  l)  activates  the  cycle,  after  which 
S»(T)  implies  R (T  +2).  However  any  occurrence  of  St(t)-~-S(t  +  1)  inactivates 
the  cycle.  Thus  a  single  trial  establishes  conditioning,  but  a  single  failure 
to  reinforce  will  extinguish  it. 

Fig.  2  shows  one  step  in  the  elaboration.  Here  one  trial  activiates 
Cp  but  does  not  yet  set  up  the  conditioning;  a  second  tried  is  required,  at 
which  time  Cd  is  activated  and  S*  (via  an  intemuncial  for  timing  purposes) 
is  made  lininal.  A  single  nonreinforcement  will  inactivate  Cp,*  a  second  will 
inactivate  also  Cp. 

If  Cp,  instead  of  Cp,  were  connected  to  R,  then  a  single  trial  would 
establish  conditioning,  but  unstably,  in  the  sense  that  a  single  nonreinforcement 
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would  extinguish  it.  Two  consecutive  trials  would  increase  the  stability, 
necessitating  two  consecutive  nonreinforcements  to  extinguish  it.  This  is 
readily  generalized  by  taking  2  cycles  in  stages,  of  which  the  m-th  is 
connected  to  R.  Then  conditioning  would  be  established  after  m  trials, 
but  each  further  trial  increases  the  stability,  up  to  the  number  n  -  m 
of  additional  stages. 

Fig.  3  represents  the  first  aspect  of  the  delayed  reflex,  with  one- 
stage  conditioning.  Note  that  S(t)  always  implies  R(t  +  l).  Now  the 
occurrence  of  S*(t)  S(t  4  l)  activates  as  in  Fig.  1,  so  that  subsequent 
occurrence  of  S*(t)  implies  R(T 4  2).  However  if  -  l),  then  the  cycle 
is  inactivated  and  the  conditioning  extinguished.  Also  the  occurrence  of 
S*(t)  S(t  4  2)  activates  Cg  so  that  subsequent  occurrence  of  S*(t)  implies 
RC'T'  -4  3)  • 

The  most  interesting  feature  of  the  delayed  reflex,  however,  in 
Pavlovfs  description  is  this.  Continued  reinforcement  of  the  delayed 
reflex  (corresponding  to  activation  of  C2  and  subsequent  stages)  has  the 
effect  that  the  time  of  occurrence  of  R  moves  progressively  forward;  non¬ 
reinforcement  first  increases  the  delay  before  continued  nonreinforcement 
finally  extinguishes  the  conditioning. 

To  achieve  this  in  its  most  elementary  form,  add  another  stage  to 
as  in  Fig.  2,  but  connect  this  cycle  to  I2  ’with  an  inhibitor  and  to  1^  with 
an  exeitor.  Then  S*(t)  is  sufficient  to  produce  R(t  +  2)  if  either  this 
new  cycle  is  active,  or  if  is  active,  whereas  if  alone  is  active,  then 
S*(t)  will  produce  R(t  4  3) • 

Fig.  4  gives  the  phenomenon  of  generalization.  It  is  supposed  that 
Spf  and  82*  are  affected  by  different  degrees  of  intensity,  or  different 
qualities  of  the  same  type  of  stimulus  (e.g.,  distinct  tones).  Either 
conjunction!  S-j_*(t)  S(t  4  l)  or  S2f(t)  S(t  4  l)  is  sufficient  to  activate 
both  cycles  and  C2,  after  which  the  occurrence  of  either  S-j_*(t)  or  of 
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S^T)  is  sufficient  for  F.(nT  +  2).  However,  the  non— reinforced  presentation 

C. 

of  above,  that  is  to  say  S|(TV“ 'S(t"  +  1),  inactivates  C  ,  but  not 

A  more  realistic  representation  requires  obviously  more  than  two  degrees 
or.  qualities,  but  should  have  also  the  following  features:  Suppose  the 
afferents  are  •  ,  S^,  in  the  order  of  degrees  of  intensity  or  of  a 

simple  ordering  of  the  qualities.  Progressive  reinforcements  of  a  parti¬ 
cular  S'  should  establish  S’  sooner  than  other  S' (and  for  establishing  S' 
a  ^  3  3 

by  reinforcing  S' ,  the  required  number  of  reinforcements  should  increase 

with  (3  —  a).  Conversely,  in  the  subsequent  extinction  of  a  given  S^,  more 

non— reinforcements  should  be  required  for  small  (p  —  al  than  for  large. 

Again,  as  extinction  proceeds  by  non— reinforcement  of  3^,  extinction  of  any 

S'  should  oroceed  at. about  the  same  relative  rate  when  3  lies  between  7  and  a; 
7 

whereas  when  7  lies  between  B  and  a  the  rate  at  which  3^  is  extinguished 
should  depend  upon  the  relative  values  of  ( 3  -  7)  and  \y  -  a). 
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